Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27.135
Filtrar
1.
Methods Mol Biol ; 2787: 183-197, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38656490

RESUMO

PacBio long-read sequencing is a third-generation technology that generates long reads up to 20 kilobases (kb), unlike short-read sequencing instruments that produce up to 600 bases. Long-read sequencing is particularly advantageous in higher organisms, such as humans and plants, where repetitive regions in the genome are more abundant. The PacBio long-read sequencing uses a single molecule, real-time approach where the SMRT cells contain several zero-mode waveguides (ZMWs). Each ZMW contains a single DNA molecule bound by a DNA polymerase. All ZMWs are flushed with deoxy nucleotides with a fluorophore specific to each nucleotide. As the sequencing proceeds, the detector detects the wavelength of the fluorescence and the nucleotides are read in real-time. This chapter describes the sample and library preparation for PacBio long-read sequencing for grapevine.


Assuntos
Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Vitis , Vitis/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , DNA de Plantas/genética , Genoma de Planta
2.
BMC Bioinformatics ; 25(1): 154, 2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38637756

RESUMO

BACKGROUND: High-throughput sequencing is a powerful tool that is extensively applied in biological studies. However, sequencers may produce low-quality bases, leading to ambiguous bases, 'N's. PCR duplicates introduced in library preparation are conventionally removed in genomics studies, and several deduplication tools have been developed for this purpose. Two identical reads may appear different due to ambiguous bases and the existing tools cannot address 'N's correctly or efficiently. RESULTS: Here we proposed and implemented TrieDedup, which uses the trie (prefix tree) data structure to compare and store sequences. TrieDedup can handle ambiguous base 'N's, and efficiently deduplicate at the level of raw sequences. We also reduced its memory usage by approximately 20% by implementing restrictedDict in Python. We benchmarked the performance of the algorithm and showed that TrieDedup can deduplicate reads up to 270-fold faster than pairwise comparison at a cost of 32-fold higher memory usage. CONCLUSIONS: The TrieDedup algorithm may facilitate PCR deduplication, barcode or UMI assignment, and repertoire diversity analysis of large-scale high-throughput sequencing datasets with its ultra-fast algorithm that can account for ambiguous bases due to sequencing errors.


Assuntos
Algoritmos , Software , Genômica , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA
3.
Nat Commun ; 15(1): 3447, 2024 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-38658554

RESUMO

Achieving cost-competitive bio-based processes requires development of stable and selective biocatalysts. Their realization through in vitro enzyme characterization and engineering is mostly low throughput and labor-intensive. Therefore, strategies for increasing throughput while diminishing manual labor are gaining momentum, such as in vivo screening and evolution campaigns. Computational tools like machine learning further support enzyme engineering efforts by widening the explorable design space. Here, we propose an integrated solution to enzyme engineering challenges whereby ML-guided, automated workflows (including library generation, implementation of hypermutation systems, adapted laboratory evolution, and in vivo growth-coupled selection) could be realized to accelerate pipelines towards superior biocatalysts.


Assuntos
Biocatálise , Engenharia de Proteínas , Engenharia de Proteínas/métodos , Enzimas/metabolismo , Enzimas/genética , Enzimas/química , Aprendizado de Máquina , Evolução Molecular Direcionada/métodos , Automação , Biblioteca Gênica
4.
Appl Microbiol Biotechnol ; 108(1): 305, 2024 Apr 21.
Artigo em Inglês | MEDLINE | ID: mdl-38643427

RESUMO

Non-equilibrium (NEQ) alchemical free energy calculations are an emerging tool for accurately predicting changes in protein folding free energy resulting from amino acid mutations. In this study, this method in combination with the Rosetta ddg monomer tool was applied to predict more thermostable variants of the polyethylene terephthalate (PET) degrading enzyme DuraPETase. The Rosetta ddg monomer tool efficiently enriched promising mutations prior to more accurate prediction by NEQ alchemical free energy calculations. The relative change in folding free energy of 96 single amino acid mutations was calculated by NEQ alchemical free energy calculation. Experimental validation of ten of the highest scoring variants identified two mutations (DuraPETaseS61M and DuraPETaseS223Y) that increased the melting temperature (Tm) of the enzyme by up to 1 °C. The calculated relative change in folding free energy showed an excellent correlation with experimentally determined Tm resulting in a Pearson's correlation coefficient of r = - 0.84. Limitations in the prediction of strongly stabilizing mutations were, however, encountered and are discussed. Despite these challenges, this study demonstrates the practical applicability of NEQ alchemical free energy calculations in prospective enzyme engineering projects. KEY POINTS: • Rosetta ddg monomer enriches stabilizing mutations in a library of DuraPETase variants • NEQ free energy calculations accurately predict changes in Tm of DuraPETase • The DuraPETase variants S223Y, S42M, and S61M have increased Tm.


Assuntos
Aminoácidos , Polietilenotereftalatos , Estudos Prospectivos , Biblioteca Gênica , Mutação
5.
Methods Mol Biol ; 2757: 185-200, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38668967

RESUMO

Long-read sequencing has proven the necessity for high-quality genomic assemblies of reference species, including enigmatic ctenophores. Obtaining high-molecular-weight genomic DNA is pivotal to this process and has proven highly problematic for many species. Here, we discuss different methodologies for gDNA isolation and present a protocol for isolating gDNA for several members of the phylum Ctenophora. Specifically, we describe a Pacific Biosciences library construction method used in conjunction with gDNA isolation methods that have proven successful in obtaining high-quality genomic assemblies in ctenophores.


Assuntos
Ctenóforos , DNA , Genômica , Análise de Sequência de DNA , Animais , Ctenóforos/genética , Genômica/métodos , DNA/genética , DNA/isolamento & purificação , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biblioteca Gênica , Genoma/genética
6.
Methods Mol Biol ; 2757: 201-214, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38668968

RESUMO

RNA-seq or transcriptome analysis of individual cells and small cell populations is essential for virtually any biomedical field. Here, we examine and discuss the different methods of RNA isolation specific to ctenophores. We present a convenient, inexpensive, and reproducible protocol for RNA-seq libraries that are designed for low quantities of samples. We demonstrated these methods on early (one, two, four, eight cells) embryonic and developmental stages, tissues, and even a single aboral organ from the ctenophore Pleurobrachia bachei and other ctenophore species (e.g., Mnemiopsis, Bolinopsis, and Beroe).


Assuntos
Ctenóforos , RNA , Animais , Ctenóforos/genética , RNA/genética , RNA/isolamento & purificação , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , RNA-Seq/métodos , Transcriptoma/genética , Análise de Sequência de RNA/métodos
7.
Methods Mol Biol ; 2757: 289-306, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38668973

RESUMO

The functional screening of cDNA libraries (or functional cloning) enables isolation of cDNA genes encoding novel proteins with unknown amino acid sequences. This approach is the only way to identify a protein sequence in the event of shortage of biological material for obtaining pure target protein in amounts sufficient to determine its primary structure, since sensitive functional test for a target protein is only required to successfully perform functional cloning. Commonly, bioluminescent proteins from representatives belonging to different taxa significantly differ in sequences due to independent origin of bioluminescent systems during evolution. Nonetheless, these proteins are frequently similar in functions and can use even the same substrate of bioluminescence reaction, allowing the use of the same functional test for screening. The cDNA genes encoding unknown light-emitting proteins can be identified during functional screening with high sensitivity, which is provided by modern light recording equipment making possible the detection of a very small amount of a target protein. Here, we present the protocols for isolation of full-size cDNA genes for the novel bioluminescent protein family of light-sensitive Ca2+-regulated photoproteins in the absence of any sequence information by functional screening of plasmid cDNA expression library. The protocols describe all the steps from gathering animals to isolation of individual E. coli colonies carrying full-size cDNA genes using photoprotein berovin from ctenophore Beroe abyssicola as an illustrative example.


Assuntos
Clonagem Molecular , Ctenóforos , DNA Complementar , Biblioteca Gênica , Proteínas Luminescentes , Animais , Ctenóforos/genética , Ctenóforos/metabolismo , Clonagem Molecular/métodos , Proteínas Luminescentes/genética , Proteínas Luminescentes/metabolismo , DNA Complementar/genética , Escherichia coli/genética , Escherichia coli/metabolismo
8.
Bioinformatics ; 40(4)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38569896

RESUMO

MOTIVATION: Long-read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. RESULTS: Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or nonunique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues. AVAILABILITY AND IMPLEMENTATION: Pacybara, freely available at https://github.com/rothlab/pacybara, is implemented using R, Python, and bash for Linux. It runs on GNU/Linux HPC clusters via Slurm, PBS, or GridEngine schedulers. A single-machine simplex version is also available.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Software , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biblioteca Gênica , Genótipo , Análise por Conglomerados
9.
BMC Genomics ; 25(1): 364, 2024 Apr 13.
Artigo em Inglês | MEDLINE | ID: mdl-38615000

RESUMO

Pseudoalteromonas viridis strain BBR56 was isolated from seawater at Dutungan Island, South Sulawesi, Indonesia. Bacterial DNA was isolated using Promega Genomic DNA TM050. DNA purity and quantity were assessed using NanoDrop spectrophotometers and Qubit fluorometers. The DNA library and sequencing were prepared using Oxford Nanopore Technology GridION MinKNOW 20.06.9 with long read, direct, and comprehensive analysis. High accuracy base calling was assessed with Guppy version 4.0.11. Filtlong and NanoPlot were used for filtering and visualizing the FASTQ data. Flye (2.8.1) was used for de novo assembly analysis. Variant calls and consensus sequences were created using Medaka. The annotation of the genome was elaborated by DFAST. The assembled genome and annotation were tested using Busco and CheckM. Herein, we found that the highest similarity of the BBR56 isolate was 98.37% with the 16 S rRNA gene sequence of P. viridis G-1387. The genome size was 5.5 Mb and included chromosome 1 (4.2 Mbp) and chromosome 2 (1.3 Mbp), which encoded 61 pseudogenes, 4 noncoding RNAs, 113 tRNAs, 31 rRNAs, 4,505 coding DNA sequences, 4 clustered regularly interspaced short palindromic repeats, 4,444 coding genes, and a GC content of 49.5%. The sequence of the whole genome of P. viridis BBR56 was uploaded to GenBank under the accession numbers CP072425-CP072426, biosample number SAMN18435505, and bioproject number PRJNA716373. The sequence read archive (SRR14179986) was successfully obtained from NCBI for BBR56 raw sequencing reads. Digital DNA-DNA hybridization results showed that the genome of BBR56 had the potential to be a new species because no other bacterial genomes were similar to the sample. Biosynthetic gene clusters (BGCs) were assessed using BAGEL4 and the antiSMASH bacterial version. The genome harbored diverse BGCs, including genes that encoded polyketide synthase, nonribosomal peptide synthase, RiPP-like, NRP-metallophore, hydrogen cyanide, betalactone, thioamide-NRP, Lant class I, sactipeptide, and prodigiosin. Thus, BBR56 has considerable potential for further exploration regarding the use of its secondary metabolite products in the human and fisheries sectors.


Assuntos
Pseudoalteromonas , Humanos , Pseudoalteromonas/genética , Pseudogenes , Biblioteca Gênica , DNA Bacteriano
10.
Nat Commun ; 15(1): 3199, 2024 Apr 13.
Artigo em Inglês | MEDLINE | ID: mdl-38615009

RESUMO

The increasing availability of experimental and computational protein structures entices their use for function prediction. Here we develop an automated procedure to identify enzymes involved in metabolic reactions by assessing substrate conformations docked to a library of protein structures. By screening AlphaFold-modeled vitamin B6-dependent enzymes, we find that a metric based on catalytically favorable conformations at the enzyme active site performs best (AUROC Score=0.84) in identifying genes associated with known reactions. Applying this procedure, we identify the mammalian gene encoding hydroxytrimethyllysine aldolase (HTMLA), the second enzyme of carnitine biosynthesis. Upon experimental validation, we find that the top-ranked candidates, serine hydroxymethyl transferase (SHMT) 1 and 2, catalyze the HTMLA reaction. However, a mouse protein absent in humans (threonine aldolase; Tha1) catalyzes the reaction more efficiently. Tha1 did not rank highest based on the AlphaFold model, but its rank improved to second place using the experimental crystal structure we determined at 2.26 Å resolution. Our findings suggest that humans have lost a gene involved in carnitine biosynthesis, with HTMLA activity of SHMT partially compensating for its function.


Assuntos
Aldeído Liases , Frutose-Bifosfato Aldolase , Humanos , Animais , Camundongos , Frutose-Bifosfato Aldolase/genética , Catálise , Biblioteca Gênica , Glicina Hidroximetiltransferase/genética , Carnitina , Mamíferos
11.
PLoS One ; 19(4): e0298927, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38625992

RESUMO

INTRODUCTION: Dyadic care, which is the concurrent provision of care for a birthing person and their infant, is an approach that may improve disparities in postnatal health outcomes, but no synthesis of existing dyadic care studies has been conducted. This scoping review seeks to identify and summarize: 1) dyadic care studies globally, in which the birthing person-infant dyad are cared for together, 2) postnatal health outcomes that have been evaluated following dyadic care interventions, and 3) research and practice gaps in the implementation, dissemination, and effectiveness of dyadic care to reduce healthcare disparities. MATERIALS AND METHODS: Eligible studies will (1) include dyadic care instances for the birthing person and infant, and 2) report clinical outcomes for at least one member of the dyad or intervention outcomes. Studies will be excluded if they pertain to routine obstetric care, do not present original data, and/or are not available in English or Spanish. We will search CINAHL, Ovid (both Embase and Medline), Scopus, Cochrane Library, PubMed, Google Scholar, Global Health, Web of Science Core Collection, gray literature, and WHO regional databases. Screening will be conducted via Covidence and data will be extracted to capture the study design, dyad characteristics, clinical outcomes, and implementation outcomes. The risk of bias will be assessed using the Joanna Briggs Institute Critical Appraisal Tool. A narrative synthesis of the study findings will be presented. DISCUSSION: This scoping review will summarize birthing person-infant dyadic care interventions that have been studied and the evidence for their effectiveness. This aggregation of existing data can be used by healthcare systems working to improve healthcare delivery to their patients with the aim of reducing postnatal morbidity and mortality. Areas for future research will also be highlighted. TRAIL REGISTRATION: This review has been registered at Open Science Framework (OSF, https://osf.io/5fs6e/).


Assuntos
Academias e Institutos , Disparidades em Assistência à Saúde , Lactente , Feminino , Gravidez , Criança , Humanos , Bases de Dados Factuais , Biblioteca Gênica , Cuidado do Lactente , Literatura de Revisão como Assunto
12.
BMC Cancer ; 24(1): 490, 2024 Apr 17.
Artigo em Inglês | MEDLINE | ID: mdl-38632528

RESUMO

BACKGROUND: Patients with rheumatologic preexisting autoimmune disease (PAD) have not been enrolled in clinical trials of immune checkpoint inhibitors (ICIs). Therefore, the risks and benefits of ICI therapy in such patients are unclear. Herein, we investigated the safety and efficacy of ICIs in rheumatologic PAD patients through a meta-analysis. METHODS: The PubMed, Cochrane Library, Embase and Web of Science databases were searched for additional studies. We analyzed the following data through Stata software: incidence of total irAEs (TirAEs), rate of flares, incidence of new on-set irAEs, rate of discontinuation, objective response rate (ORR) and disease control rate (DCR). RESULTS: We identified 23 articles including 643 patients with rheumatologic PAD. The pooled incidences of TirAEs, flares and new-onset irAEs were 64% (95% CI 55%-72%), 41% (95% CI 31%-50%), and 33% (95% CI 28%-38%), respectively. In terms of severity, the incidences were 7% (95% CI 2%-14%) for Grade 3-4 flares and 12% (95% CI 9%-15%) for Grade 3-4 new-onset irAEs. Patients with RA had a greater risk of flares than patients with other rheumatologic PADs did (RR = 1.35, 95% CI 1.03-1.77). The ORR and DCR were 30% and 44%, respectively. Baseline anti-rheumatic treatment was not significantly associated with the frequency of flares (RR = 1.05, 95% CI 0.63-1.77) or the ORR (RR = 0.45, 95% CI 0.12-1.69). CONCLUSIONS: Patients with rheumatologic PAD, particularly those with RA, are susceptible to relapse of their rheumatologic disease following ICI therapy. ICIs are also effective for treating rheumatologic PAD patients. PROSPECTIVE REGISTER OF SYSTEMATIC REVIEWS (PROSPERO): number CRD 42,023,439,702.


Assuntos
Artrite Reumatoide , Doenças Autoimunes , Neoplasias , Humanos , Inibidores de Checkpoint Imunológico , Bases de Dados Factuais , Biblioteca Gênica
13.
Genome Biol Evol ; 16(4)2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38597156

RESUMO

De novo genes emerge from previously noncoding stretches of the genome. Their encoded de novo proteins are generally expected to be similar to random sequences and, accordingly, with no stable tertiary fold and high predicted disorder. However, structural properties of de novo proteins and whether they differ during the stages of emergence and fixation have not been studied in depth and rely heavily on predictions. Here we generated a library of short human putative de novo proteins of varying lengths and ages and sorted the candidates according to their structural compactness and disorder propensity. Using Förster resonance energy transfer combined with Fluorescence-activated cell sorting, we were able to screen the library for most compact protein structures, as well as most elongated and flexible structures. We find that compact de novo proteins are on average slightly shorter and contain lower predicted disorder than less compact ones. The predicted structures for most and least compact de novo proteins correspond to expectations in that they contain more secondary structure content or higher disorder content, respectively. Our experiments indicate that older de novo proteins have higher compactness and structural propensity compared with young ones. We discuss possible evolutionary scenarios and their implications underlying the age-dependencies of compactness and structural content of putative de novo proteins.


Assuntos
Dobramento de Proteína , Proteínas , Humanos , Proteínas/genética , Estrutura Secundária de Proteína , Biblioteca Gênica
14.
Sci Rep ; 14(1): 8159, 2024 04 08.
Artigo em Inglês | MEDLINE | ID: mdl-38589623

RESUMO

Whole-genome sequencing (WGS) is currently making its transition from research tool into routine (clinical) diagnostic practice. The workflow for WGS includes the highly labor-intensive library preparations (LP), one of the most critical steps in the WGS procedure. Here, we describe the automation of the LP on the flowbot ONE robot to minimize the risk of human error and reduce hands-on time (HOT). For this, the robot was equipped, programmed, and optimized to perform the Illumina DNA Prep automatically. Results obtained from 16 LP that were performed both manually and automatically showed comparable library DNA yields (median of 1.5-fold difference), similar assembly quality values, and 100% concordance on the final core genome multilocus sequence typing results. In addition, reproducibility of results was confirmed by re-processing eight of the 16 LPs using the automated workflow. With the automated workflow, the HOT was reduced to 25 min compared to the 125 min needed when performing eight LPs using the manual workflow. The turn-around time was 170 and 200 min for the automated and manual workflow, respectively. In summary, the automated workflow on the flowbot ONE generates consistent results in terms of reliability and reproducibility, while significantly reducing HOT as compared to manual LP.


Assuntos
Lipopolissacarídeos , Robótica , Humanos , Reprodutibilidade dos Testes , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biblioteca Gênica , Sequenciamento Completo do Genoma , DNA , Fluxo de Trabalho
15.
Molecules ; 29(7)2024 Mar 31.
Artigo em Inglês | MEDLINE | ID: mdl-38611841

RESUMO

The construction of a small molecule library that includes compounds with medium-sized rings is increasingly essential in drug discovery. These compounds are essential for identifying novel therapeutic agents capable of targeting "undruggable" targets through high-throughput and high-content screening, given their structural complexity and diversity. However, synthesizing medium-sized rings presents notable challenges, particularly with direct cyclization methods, due to issues such as transannular strain and reduced degrees of freedom. This review presents an overview of current strategies in synthesizing medium-sized rings, emphasizing innovative approaches like ring-expansion reactions. It highlights the challenges of synthesis and the potential of these compounds to diversify the chemical space for drug discovery, underscoring the importance of medium-sized rings in developing new bioactive compounds.


Assuntos
Descoberta de Drogas , Osteopatia , Biblioteca Gênica , Ciclização
16.
Microb Genom ; 10(4)2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38578268

RESUMO

Background. PCR amplification is a necessary step in many next-generation sequencing (NGS) library preparation methods [1, 2]. Whilst many PCR enzymes are developed to amplify single targets efficiently, accurately and with specificity, few are developed to meet the challenges imposed by NGS PCR, namely unbiased amplification of a wide range of different sizes and GC content. As a result PCR amplification during NGS library prep often results in bias toward GC neutral and smaller fragments. As NGS has matured, optimized NGS library prep kits and polymerase formulations have emerged and in this study we have tested a wide selection of available enzymes for both short-read Illumina library preparation and long fragment amplification ahead of long-read sequencing.We tested over 20 different hi-fidelity PCR enzymes/NGS amplification mixes on a range of Illumina library templates of varying GC content and composition, and find that both yield and genome coverage uniformity characteristics of the commercially available enzymes varied dramatically. Three enzymes Quantabio RepliQa Hifi Toughmix, Watchmaker Library Amplification Hot Start Master Mix (2X) 'Equinox' and Takara Ex Premier were found to give a consistent performance, over all genomes, that mirrored closely that observed for PCR-free datasets. We also test a range of enzymes for long-read sequencing by amplifying size fractionated S. cerevisiae DNA of average size 21.6 and 13.4 kb, respectively.The enzymes of choice for short-read (Illumina) library fragment amplification are Quantabio RepliQa Hifi Toughmix, Watchmaker Library Amplification Hot Start Master Mix (2X) 'Equinox' and Takara Ex Premier, with RepliQa also being the best performing enzyme from the enzymes tested for long fragment amplification prior to long-read sequencing.


Assuntos
DNA , Saccharomyces cerevisiae , Reação em Cadeia da Polimerase/métodos , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos
17.
Int J Mol Sci ; 25(7)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38612639

RESUMO

Single-cell RNA sequencing (scRNA-seq) has emerged as a powerful technique for investigating biological heterogeneity at the single-cell level in human systems and model organisms. Recent advances in scRNA-seq have enabled the pooling of cells from multiple samples into single libraries, thereby increasing sample throughput while reducing technical batch effects, library preparation time, and the overall cost. However, a comparative analysis of scRNA-seq methods with and without sample multiplexing is lacking. In this study, we benchmarked methods from two representative platforms: Parse Biosciences (Parse; with sample multiplexing) and 10x Genomics (10x; without sample multiplexing). By using peripheral blood mononuclear cells (PBMCs) obtained from two healthy individuals, we demonstrate that demultiplexed scRNA-seq data obtained from Parse showed similar cell type frequencies compared to 10x data where samples were not multiplexed. Despite relatively lower cell capture affecting library preparation, Parse can detect rare cell types (e.g., plasmablasts and dendritic cells) which is likely due to its relatively higher sensitivity in gene detection. Moreover, a comparative analysis of transcript quantification between the two platforms revealed platform-specific distributions of gene length and GC content. These results offer guidance for researchers in designing high-throughput scRNA-seq studies.


Assuntos
Benchmarking , Leucócitos Mononucleares , Humanos , Biblioteca Gênica , Genômica , Análise de Sequência de RNA
18.
BMC Genomics ; 25(1): 227, 2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38429743

RESUMO

BACKGROUND: Hybridization capture-based targeted next generation sequencing (NGS) is gaining importance in routine cancer clinical practice. DNA library preparation is a fundamental step to produce high-quality sequencing data. Numerous unexpected, low variant allele frequency calls were observed in libraries using sonication fragmentation and enzymatic fragmentation. In this study, we investigated the characteristics of the artifact reads induced by sonication and enzymatic fragmentation. We also developed a bioinformatic algorithm to filter these sequencing errors. RESULTS: We used pairwise comparisons of somatic single nucleotide variants (SNVs) and insertions and deletions (indels) of the same tumor DNA samples prepared using both ultrasonic and enzymatic fragmentation protocols. Our analysis revealed that the number of artifact variants was significantly greater in the samples generated using enzymatic fragmentation than using sonication. Most of the artifacts derived from the sonication-treated libraries were chimeric artifact reads containing both cis- and trans-inverted repeat sequences of the genomic DNA. In contrast, chimeric artifact reads of endonuclease-treated libraries contained palindromic sequences with mismatched bases. Based on these distinctive features, we proposed a mechanistic hypothesis model, PDSM (pairing of partial single strands derived from a similar molecule), by which these sequencing errors derive from ultrasonication and enzymatic fragmentation library preparation. We developed a bioinformatic algorithm to generate a custom mutation "blacklist" in the BED region to reduce errors in downstream analyses. CONCLUSIONS: We first proposed a mechanistic hypothesis model (PDSM) of sequencing errors caused by specific structures of inverted repeat sequences and palindromic sequences in the natural genome. This new hypothesis predicts the existence of chimeric reads that could not be explained by previous models, and provides a new direction for further improving NGS analysis accuracy. A bioinformatic algorithm, ArtifactsFinder, was developed and used to reduce the sequencing errors in libraries produced using sonication and enzymatic fragmentation.


Assuntos
Artefatos , Genoma Humano , Humanos , Biblioteca Gênica , Análise de Sequência de DNA/métodos , DNA de Neoplasias , Sequenciamento de Nucleotídeos em Larga Escala/métodos
19.
Methods Mol Biol ; 2776: 243-257, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38502509

RESUMO

Global understanding of plastid gene expression has always been impaired by its complexity. RNA splicing, editing, and intercistronic processing create multiple transcripts isoforms that can hardly be resolved using traditional molecular biology techniques. During the last decade, the wide adoption of RNA-seq-based techniques has, however, allowed an unprecedented understanding of all the different steps of chloroplast gene expression, from transcription to translation. Current strategies are nonetheless unable to identify and quantify full length transcripts isoforms, a limitation that can now be overcome using Nanopore Sequencing. We here provide a complete protocol to produce, from total leaf RNA, cDNA libraries ready for Nanopore sequencing. While most Nanopore protocols take advantage of the mRNA polyA tail we here first ligate an RNA adapter to the 3' ends of the RNAs and use it to initiate the template switching reverse transcription. The cDNA is then prepared and indexed for use with the regular Oxford Nanopore v14 chemistry. This protocol is of particular interest to researchers willing to simultaneously study the multiple post-transcriptional processes prevalent in the chloroplast.


Assuntos
Sequenciamento por Nanoporos , Transcriptoma , Sequenciamento por Nanoporos/métodos , Biblioteca Gênica , RNA/genética , Isoformas de Proteínas/genética , Cloroplastos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos
20.
Biotechniques ; 76(5): 216-223, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38530148

RESUMO

Ancient DNA (aDNA) obtained from human remains is typically fragmented and present in relatively low amounts. Here we investigate a set of optimal methods for producing aDNA data by comparing silica-based DNA extraction and aDNA library preparation protocols. We also test the efficiency of whole-genome enrichment (WGC) on ancient human samples by modifying a number of parameter combinations. We find that the Dabney extraction protocol performs significantly better than alternatives. We further observed a positive trend with the BEST library protocol indicating lower clonality. Notably, our results suggest that WGC is effective at retrieving endogenous DNA, particularly from poorly-preserved human samples, by increasing human endogenous proportions by 5x. Thus, aDNA studies will be most likely to benefit from our results.


Assuntos
DNA Antigo , Genoma Humano , DNA Antigo/análise , DNA Antigo/isolamento & purificação , Humanos , Genoma Humano/genética , Biblioteca Gênica , Análise de Sequência de DNA/métodos , Dióxido de Silício/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...